RESULT 1 
T14271 

Doc4 protein, stress- induced - mouse 
N; Alternate names: odz protein homolog 
C; Species: Mus musculus (house mouse) 

C;Date: 20-Sep-1999 #sequence_revision 20-Sep-1999 #text_change 20-Sep-1999 

C;Acce9sion: T14271 

R;Wang, X.2.; Kuroda, M. ; Sok, J.; Batchvarova, N. ; Kimmel, R. ; Chung, P.; Zinszner, H.; Ron, D. 
EMBO J. 17, 3619-3630, 1998 

A; Title: Identification of novel stress -induced genes downstream of chop. 
A; Reference number: 217951; MUID: 98315054 ; PMID: 9649432 
A; Accession: T14271 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-2825 <WAN> 

A; Cross -references: EMBL:AF059485; NID :g3170614 ; PID : g3170615 ; PIDN : AAC31807 . 1 
C; Genetics : 
A; Gene: Doc4' 

Query Match 96.6%; Score 14306; DB 2; Length 2825; 

Best Local Similarity 94.0%; Pred. No. 0; 

2655; Conservative 47; Mismatches 51; Indels 72; Gaps 2; 

MDVKERKPYRSLTRRRDAERRYTSSSADSEEGKAPQKSYSSSETLKAYDQDARLAYGSRV 6 0 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIII {Mlllllllllllllllllllllll 

MDVKERKPYRSLTRRRDAERRYTSSSADSEEGKGPQKSYSSSETLKAYDQDARLAYGSRV 6 0 
KDIVPQEAEEFCRTGANFTLRELGLEEVTPPHGTLYRTDIGLPHCGYSMGAGSDADMEAD 120 

Ihllllllllllll IIIIIMII hlllllllllllllllllllllll lllhlll 

KDMVPQEAEEFCRTGTNFTLRELGLGEMTPPHGTLYRTDIGLPHCGYSMGASSDADLEAD 120 
TVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHENTET 164 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMII 

TVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHENTETGAPLHCSSASSTPIEQ 180 
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-DHPGGL 170 
III I 



QNHARLRTPPPPLSHAHTPNQHHAASINSLNRGNFTPRSNPSPAPTDHSLSGEPPAGGAQ 230 

III lllllllll lllllllllllllllllllllllllllllllllllllllllll II 

QNHPRLRTPPPPLPHAHTPNQHHAASINSLNRGNFTPRSNPSPAPTDHSLSGEPPAGSAQ 300 

EPAHAQENWLLNSNIPLETRNLGKQPFLGTLQDNLIEMDILGASRHDGAYSDGHFLFKPG 290 

II llhlhlll IhllllllllllMI lllllllll III lllllllll INI 

EPTHAQDNWVLNSKIPVETRNLGKQPFLGTWQDNLIEMDIFSASRRDGAYSDGHFFFKPG 360 

GTSPLFCTTSPGYPLTSSTVYSPPPRPLPRSTFARPAFNLKKPSKYCNWKCAALSAIVIS 350 

llllllllllllllllllllllllllllllllhllllllllllllllllllllllhll 

GTSPLFCTTSPGYPLTSSTVYSPPPRPLPRSTFSRPAFNLKKPSKYCNWKCAALSAILIS 420 

ATLVILLAYFVAMHLFGLNWHLQPMEG- -QMYEITEDTASSWPVPTDVSLYPSGGTGLET 408 

IIIMIIIIIIIIIIIIIIIIMIIII lllllllllllllllllllllllllllllll 

ATLVILLAYFVAMHLFGLNWHLQPMEGQMQMYEITEDTASSWPVPTDVSLYPSGGTGLET 480 

PDRKGKGTTEGKPSSFFPEDSFIDSGEIDVGRRASQKIPPGTFWRSQVFIDHPVHLKFNV 468 

lllllll llllll llllllllllllllllllllllllllllllllllllllllllll 

PDRKGKGAAEGKPSSLFPEDSFIDSGEIDVGRRASQKIPPGTFWRSQVFIDHPVHLKFNV 540 

SLGKAALVGIYGRKGLPPSHTQFDFVELLDGRRLLTQEARSLEGTPRQSRGTVPPSSHET 528 

IIIIIIIIIIIIIIIIIMIII llllllllllilMIIIIIII Mill llllllll 

SLGKAALVGIYGRKGLPPSHTQLDFVELLDGRRLLTQEARSLEGPQRQSRGPVPPSSHET 600 

GFIQYLDSGIWHLAFYNDGKESEWSFLTTAIESVDNCPSNCYGNGDCISGTCHCFLGPL 588 

IIMIIIIIIIIIIIIIIIIIIIIIilllMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GFIQYLDSGIWHLAFYNDGKESEWSFLTTAIESVDNCPSNCYGNGDCISGTCHCFLGFL 660 

GPDCGRASCPVLCSGNGQYMKGRCLCHSGWKGAECDVPTNQCIDVACSNHGTCITGTCIC 648 

IIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIhlllll lllll 

GPDCGRASCPVLCSGNGQYMKGRCLCHSGWKGAECDVPTNQCIDVACSSHGTCIMGTCIC 720 
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NPGYKGESCEEVDCMDPTCSGRGVCVRGECHCSVGWGGTNCETPRATCLDQCSGHGTFLP 708 

IIIIIIIMIIIIIIIIIII IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIi 
NPGYKGESCEEVDCMDPTCSSRGVCVRGECHCSVGWGGTNCETPRATCLDQCSGHGTFLP 780 

DTGLCSCDPSWTGHDCSIEICAADCGGHGVCVGGTCRCEDGWMGAACDQRACHPRCAEHG 768 

llllhlMIIIIMIIIMIIIIMIIIIIIIIilllllllllllllllllMilllli 

DTGLCNCDPSWTGHDCSIEICAADCGGHGVCVGGTCRCEDGWMGAACDQRACHPRCAEHG 840 

TCRDGKCECSPGWNGEHCTIEGCPGLCNGNGRCTLDLNGWHCVCQLGWRGAGCDTSMETA 828 

IIIIMMhIIIIIMIIIIIMIMMIIIIIIIIIIIIIIIIIIIIi lllillll 

TCRDGKCECTPGWNGEHCTIEGCPGLCNGNGRCTLDLNGWHCVCQLGWRGTGCDTSMETG 900 

CGDS KDNDGDGLVDCMD PDCCLQPLCH I NPLCLGS PNPLD 1 1 QETQVP VSQQNLHS FYDR 888 

III IIIMMIIIIMIIIIIIIIIhlllllllhlllllllM llllllhlllll 

CGDGKDNDGDGLVDCMDPDCCLQPLCHVNPLCLGSPDPLDIIQETQAPVSQQNLNSFYDR 960 

IKFLVGRDSTHIIPGENPFDGGHACVIRGQVMTSDGTPLVGVNISFVNNPLFGYTISRQD 948 

IIIIIIIMII IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIhlllllllMIIII 

I KFLVGRDSTHS I PGENPFDGGHACVIRGQVMTSDGTPLVGVNI S FINNPLFGYTI SRQD 1020 
GSFDLVTNGGISIILRFERAPFITQEHTLWLPWDRFFVMETIIMRHEENEIPSCDLSNFA 1008 

IIIIIIIMIIIIIIIIIIIIIMIMMIMIIIIIMIIhllllllllll mill 

GSFDLVTNGGISIILRFERAPFITQEHTLWLPWDRFFVMETIVMRHEENEIPSRDLSNFA 1080 
RPNPWSPSPLTSFASSCAEKGPIVPEIQALQEEISISGCKMRLSYLSSRTPGYKSVLRI 1068 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII hllllllllllllllllllhll 

RPNPWSPSPLTSFASSCAEKGPIVPEIQALQEEIVIAGCKMRLSYLSSRTPGYKSWRI 1140 

SLTHPTIPFNLMKVHLMVAVEGRLFRKWFAAAPDLSYYFIWDKTDVYNQKVFGLSEAFVS 1128 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllliill 
SLTHPTI PFNLMKVHLMVAVEGRLFRKWFAAAPDLSYYFIWDKTDVYNQKVFGLSEAFVS 1200 

VGYEYESCPDLILWEKRTTVLQGYEIDASKLGGWSLDKHHALNIQSGILHKGNGENQFVS 1188 

llllllllilllllllll lllllllllllllllllllllllllllllllllllllllll 
VGYEYESCPDLILWEKRTAVLQGYEIDASKLGGWSLDKHHALNIQSGILHKGNGENQFVS 1260 

QQPPVIGSIMGNGRRRSISCPSCNGLiADGNKLLAPVALTCGSDGSLYVGDFNYIRRIFPS 1248 

lllllllllillllllllllllllllllllllllllllllllllllllllllllllllM 

QQPPVIGSIMGNGRRRSISCPSCNGLADGNKLIiAPVALTCGSDGSLYVGDFNYIRRIFPS 1320 

GNVTNILELSHSPAHKYYLATDPMSGAVFLSDSNSRRVFKIKSTVWKDLVKNSEWAGT 1308 

lllllllhllllllilllllllllllllllhllllllhlll lllllllllllllll 

GNVTNILEMSHSPAHKYYLATDPMSGAVFLSDTNSRRVFKVKSTTWKDLVKNSEWAGT 1380 

GDQCLPFDDTRCGDGGKATEATLTNPRGITVDKFGLIYFVDGTMIRRIDQNGIISTLLGS 1368 

MIIIMIIMIIIIIIIIIIMIIIillllllllllllMlllllhllllllllllll 

GDQCLPFDDTRCGDGGKATEATLTNPRGITVDKFGLIYFVDGTMI RRVDQNGII STLLGS 1440 
NDLTSARPLSCDSVMDISQVHLEWPTDLAINPMDNSLYVLDNNWLQISENHQVRIVAGR 1428 

INIIMIIIMIIhllll IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 

NDLTSARPLSCDSVMEISQVRLBWPTDLAINPMDNSLYVLDNNWLQISENHQVRI VAGR 1500 

PMHCQVPGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEKKINRIRQVTTSGEIS 1488 

llllllllll lllllllllllllllllllllllllllllllllllllllllllllllll 
PMHCQVPGIDQFLLSKVAIHATLESATALAVSHNGVLYI AETDEKKINRI RQVTTSGE IS 1560 

LVAGAPSGCDCKNDANCDCFSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFIR 1548 

lllllllllllllllllllillllllllllllllllllllllllllllllllllllllll 
LVAGAPSGCDCKNDANCDCFSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFIR 1620 

KNKPFLNTQNMYELSSPIDQELYLFDTTGKHLYTQSLPTGDYLYNFTYTGDGDITLITDN 1608 

llllillllillllllllllllllllhlllllllllllllllllllllllllll MM 

KNKPFLNTQNMYELSS PIDQELYLFDTSGKHLYTQSLPTGDYLYNFTYTGDGDITHITDN 1680 

NGNIWNTORDSTGMPLWLWPDGQVYWVTMGTNSALKS VTTQGHELAMMTYHGNSGLLAT 1668 

II II II II 1 1 II II I II M I M II I M II II II II I : I II II I II II II II II M M I II 

NGNMVNVRRDSTGMPLWLWPDGQVYWVTMGTNS ALRSVTTQGHELAMMTYHGNSGLLAT 1740 

KSNENG WTTFYE YDS FGRLTNVTF PTGQVS S FRSDTDS S VHVQVETS S KDDVTI TTNLSA 1728 

llllllllllllllllllllllllllllllllllllllllllllllllllllillhll 

KSNENGWTTPYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTITTHLSG 1800 
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SGAFYTLLQDQVRNSYYIGADGSLMjLLJ^GMEVALQTEPHLIAGTVNPTVGKRNVTLPI 1788 

lllillllllllllllllllllllllllllllllllllllllllllllllllllllllll 

SGAFYTLLQDQVRNSYYIGADGSLRLLIJ^GMEVALQTEPHLLAGTVNPTVGKRNVTLPI 1860 
DNGLNLVEWRQRKEQARGQVTVFGRRLRVHNRNLLSLDFDRVTRTEKIYDDHRKFTLRIL 1848 

llllllllllllllllllllllll IIIIIIIIIIMIIIIIIMMIIMIIIIIIIII 

DNGLNLVEWRQRKEQARGQVTVFGPRLRVHNRNLLSLDFDRVTRTEKI YDDHRKFTLRIL 1920 
YDQAGRPSLWSPSSRLNGVNVTYSPGGYIAGIQRGIMSERMEYDQAGRITSRIFADGKTW 1908 

IIIIMII lllllllllllllllllhllllllllllllllllllllllllllllll I 

YDQAGRPSFWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYDQAGRITSRIFADGKMW 1980 
SYTYLEKSMVLLLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNIYQPPEG 1968 

lllllllllll IIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIMIIIIIIIIII 

SYTYLEKSMVLHLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNIYQPPEG 204 0 
NASVIQDFTEDGHLLHTFYLGTGRRVI YKYGKLSKLAETLYDTTKVSFTYDETAGMLKTI 2028 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIh 

NASVIQDFTEDGHLLHTFYLGTGRRVI YKYGKLSKLAETLYDTTKVSFTYDETAGMLKTV 2100 



llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 



IDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAYGRMKEVQYEI FRSLMY 214 8 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII|:||||||IMIIIIM mill 

IDLYRYDDVSGKTEQFGKFGVI YYDINQIITTAVMTHSKHFDAYGRMKEVQYETFRSLMY 2220 
WMTVQYDNMGRWKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNLH 2208 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 

WMTVQYDNMGRWKKELKVGPYANTTRYSYEYDADGQLQTVS INDKPLWRYS YDLNGNLH 2280 



lllllllllllllllhllllllllllllllllll llllllhllllllllllll 



GSWSVRYRYDGLGRRVSSKSSHSHHLQFFYADLTNPTKVTHLYNHSSSEITSLYYDLQGH 2328 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIhllllllMMMIMI 

SGWSVRYRYDGLGRRVSSKSSHSHHLQFFYADLTNPTKVTHLYSHSSSEITSLYYDLQGH 2400 
LFAMELS SGDEFYI ACDNIGTPLAVFSGTGLMI KQI LYTAYGE I YMDTNPNFQI I IGYHG 2388 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

LFAMELSSGDEFYIACDNIGTPLAVFSGTGLMIKQILYTAYGEIYMDTNPNFQIIIGYHG 2460 

GLYDPLTKLVHMGRRDYDVLAGRWTS PDHELWKHLS SSNVMPFNLYMFKNNNPI SNSQDI 244 8 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII IM::^:|hllllllllllll|||| 
GLYDPLTKLVHMGRRDYDVLAGRWTSPDHELWKRLSSNSIVPFHLYMFKNNNPISNSQDI 2520 

KCFMTDVNSWLLTFGFQLHNVIPGYPKPDMDAMEPSYELIHTQMKTQEWDNSKSILGVQC 2508 

IIIIIIIMIIIIIIIIIIIIIIIIIIII llllllllhlllllllllllllllllill 

KCFMTDVNSWLLTFGFQLHNVIPGYPKPDTDAMEPSYELVHTQMKTQEWDNSKSILGVQC 2580 
EVQKQLKAFVTLERFDQLYGSTITSCQQAPKTKKFASSGS VFGKGVKFALKDGRVTTDI 1 2568 

IIIIIIMIIIIIIIIIIIIIIIIiMllhllllllllhlllllllllllllllllll 

EVQKQLKAFVTLERFDQLYGSTITSCQQAPETKKFASSGS I FGKGVKFALKDGRVTTDI I 2640 
S VANEDGRRVAAI LNHAHYLENLHFTIDGVDTHYFVKPGPSEGDLAI LGLSGGRRTLENG 2628 

llllllllhllllhllllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

SVANEDGRRIAAILNNAHYLENLHFTIDGVDTHYFVKPGPSEGDLAILGLSGGRRTLENG 2700 
VNVTVSQINTVLNGRTRRYTDIQLQYGALCLNTRYGTTLDEEKARVLELARQRAVRQAWA 2688 

lllllllllhhIIIIIIMIIIII llllllllllhllll lllillllllllllll 

VNVTVSQINTMLSGRTRRYTDIQLQYRALCLNTRYGTTVDEEKVRVLELARQRAVRQAWA 2760 
REQQRLREGEEGLRAWTEGEKQQVLSTGRVQGYDGFFVISVEQYPELSDSANNIHFMRQS 2748 

llllllllllllllllhllllllhllllllllllll lllllllllllllllllllll 

REQQRLREGEEGLRAWTDGEKQQVLNTGRVQGYDGFFVTSVEQYPELSDSANNIHFMRQS 2820 
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2821 EMGRR 2825 



